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1 Introduction 

We provide asymptotically sharp bounds for the Gaussian surface area and the 
Gaussian noise sensitivity of polynomial threshold functions. In particular we 
show that if / is a degree-d polynomial threshold function, then its Gaussian 
sensitivity at noise rate e is less than some quantity asymptotic to and the 
Gaussian surface area is at most -4=. Furthermore these bounds are asymp- 
totically tight as e — > and / the threshold function of a product of d distinct 
homogeneous linear functions. 

The noise sensitivity and surface area are both of fundamental interest and 
useful in the analysis of agnostic learning algorithms (see [6^). In particular 
our results imply that the class of degree-d polynomial threshold functions is 
agnostically learnable under the rt-dimensional Gaussian distribution in time 

A number of other authors have attempted to prove bounds along these lines. 
[7] proves a bound on noise sensitivity in terms of surface area and we relate 
our bounds essentially by also proving the other direction of this inequality for 
boolean functions with smooth interface that switch signs a bounded number 
of times on any line through the origin. Our bounds are obtained via a simple 
computation in the case of c? = 1. A bound of 0(e^/'-^'''') noise sensitivity was 
recently proved by 13^ and independently by [5j for multilinear polynomials. 

There is also interest in related questions for points picked uniformly from 
vertices of the hypercube rather than with the Gaussian distribution. It is 
conjectured in 4 that the corresponding noise sensitivity in this case is also 
always 0{dy^). The d — I case of this conjecture was proved by 8 , improving 
upon a bound of 0(e^/'*) of 2 . It is noted in j3] that such a result would imply 
a similar bound for the Gaussian case. Hence our results can be thought of as 
a first step toward proving this conjecture. 
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1.1 Basic Definitions 



Given a function / : M" { — 1, 1} we define the Gaussian noise sensitivity at 
noise rate e as 

GNS,(/) :=Pr(/(X)^/(Z)) 

where X is an n-dimensional Gaussian random variable, and Z — (I — e)X + 
y/2e — e^Y for Y an independent n-dimensional Gaussian. 

This is closely related to the Gaussian surface area of In particular 

we define the Gaussian surface area of a set A to be 

, ^, , . Gaussian Volume(A5\ A) 
T(A) := linimf . 

.5^0 5 

Where the Gaussian volume of a region R is Pr(X e R) for X a Gaussian 
random variable, and where As is the set of points x so that d{x, A) < 6 (under 
the Euclidean metric). We note that if A is an open region whose boundary is 
smooth away from codimension 2, that its Gaussian surface area is equal to 

/ 4>{x)da. 
J OA 

Where (j){x) is the Gaussian density, and da is the surface measure on dA. 
Furthermore if A is such a region, then its Gaussian surface area is seen to be 
equal to 

Gaussian Volume((i9A)5) 
Imi . 

.5^0 26 

For / a boolean function, we define 

r(/) :=r(/-i(i)). 

The concepts of noise sensitivity and surface area are related to each other 
by noting that the noise sensitivity is roughly the probability that X is close 
enough to the boundary that wiggling it will push it over the boundary. 



1.2 Statement of Results 

We focus on proving two main results. We define / to be a degree d polynomial 
threshold function if f{x) = sgn(p(a;)) for some degree d polynomial p. We 
prove the following Theorems about such functions: 

Theorem 1. If f is a degree d polynomial threshold function, then 

TT TT 

Furthermore this bound is asymptotically tight as e — > for the threshold func- 
tion of any product of distinct linear functions. 

Theorem 2. /// is a degree-d polynomial threshold function then T{f ) < 

Section 2 will be devoted to the proof of Theorem [1] Section 3 to the proof 
of Theorem [21 and Section 4 will provide some closing notes. 
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2 Proof of the Noise Sensitivity Bound 



Proof of Theorem[J\ We begin by letting 9 ~ arcsin(-\/2e — e^). We need to 
bound 

p GNS,(/) = Pr(/(X) ^ /(cos(0)X + sin(e)y)). (1) 

We note that the value of p given in Equation[l]remains the same if X and Y are 
replaced by any X' and Y' that are i.i.d. Gaussian distributions. In particular 
we define 

X^ = cos((/))X + sin(0)y. 

Note that and X0+7r/2 are i.i.d. Gaussians. Using these distributions we 
find that for any cf) that since 

Xg+^ = 008(6* + (P)X + sm{0 + (P)Y 

= cos{e) cos{(j))X - sm{e) sm{(j})X + cos(6') sm{(j))Y + sm{e) cos(0)r 
= cos(6')X0 + sin(6')X0+^/2, 

we have 

p - Pr(/(X0) ^ f{X^+e)). 
Therefore we have for any integer n that 

np = Pr(/(Xo) ^ f{Xe))+Fv{f{Xe) ^ f{X2e))+. ■ .+Pr(/(X(„_i)e) ^ /(^ns)). 

(2) 

We define the random function F : { — 1,1} by 

F{^) = f{X^). 

{F depends on X and Y as well as (j)). We note that the left hand side of 
Equation [2] is at most the number of times that F{(j)) changes signs on the 
interval [0, n9]. Therefore we have that 

^ E [number of times F changes signs on [0, n0]] 
~ n 

We note that F{(f>) is periodic in (p with period 27r. Therefore the number of 
times F changes sign on [0, n9] is the number of times that F changes sign on 
[0,27r) times (f^ + 0(1)). Applying this to Equation[31 we get that 

^ E[number of sign changes of F on [0, 27r)] + 0(1)) 
^ ~ n 

Taking a limit as n — > oo yields 

6^E [number of sign changes of F on [0, 2tt]] 
P- 2^ • 

We now make use of the fact that / is a degree d polynomial threshold 
function. In particular we will show that for any X and Y that F changes signs 
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at most 2d times on [0, 27r). We let / = sgn(sr) for some degree d polynomial 
g. We note that the number of sign changes of F is equal to the number of 
zeroes of the function g{cos{(j))X + sin{(j))Y) (unless this function is identically 
0, which happens with probability and can be ignored). It should be noted 
though that g{cos{(j))X + sin((/))y) = if and only if 2 = e"^ is a root of the 
degree-2cZ polynomial 



Therefore the expectation in Equation [4] is at most 2d. Therefore we have 

2d9 dO 

ZTT TT 

as desired. 

We also note the ways in which the above bound can fail to be tight. Firstly, 
there may be some probability that F changes signs less than 2c? times on a full 
circle. Secondly, the number of times F changes signs may be more than the 
fraction of the time that f{n9) ^ /((« + 1)6*) if sign changes are spaced more 
tightly than 0. On the other hand it should be noted that if / is the threshold 
function for a product of d distinct homogeneous linear functions, the first case 
happens with probability 0, and the probability of the second case occurring 
will necessarily go to as e does. Therefore for such functions our bound is 
asymptotically correct as e ^ 0. □ 



z + z^^ 



3 Proof of the Gaussian Surface Area Bounds 

We will first need to bound a slight variant of the noise sensitivity of a polyno- 
mial threshold function. We begin by proving the following Lemma: 

Lemma 3. If f is a degree d polynomial threshold function in n dimensions, 
e > and X a random Gaussian variable, then 

Pr{f{X)^f{X{l + e)))<de^. 

Proof. First note that by first conditioning on the line that X lies in we may 
reduce this problem to the case of a one dimensional distribution. Note that / 
changes sign at most d times along this line. We need to bound the probability 
that at least one of these sign changes is between X and (1 + e)X. It therefore 
suffices to prove that for any one of these sign changes, that it lies between 
X and (1 + e)X with probability at most e-y/^. Note that the probability 
that X is on the correct side of the origin is ^- Beyond that \X\'^ satisfies 
the distribution with n degrees of freedom, namely 2"/2r{n/2) x^^^^^^'^^^^dx. 
Letting y — log(a;) = 21og(|X|) we find that that y has distribution 

1 e"v/^e-^"^^dy. 



2"/2r(n/2) 
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We want the probability that y is within a particular range of size 21og(l + e). 
This is at most 2e times the densest part of the density function. This is achieved 
when ny — is maximal, or when y — log(n). Then the density is 



2"/2r(n/2) V 47r (n/2) 

Multiplying this by d, 2e and ^ (the probability that X is on the correct side 
of 0), we get our bound. Notice also that this bound should be nearly sharp 
if the polynomial giving / is a product of terms of the form \X\'^ — ri for n 
approximately n and spaced apart by factors of (1 + e)^. □ 

We can now prove a bound on a quantity more relevant to Gaussian surface 
area: 

Corollary 4. // f is an n dimensional, degree d polynomial threshold function, 
e > and X and Y independent Gaussians, then 

PrU{X)^f{X + eY))<^ + ^.[l. 

TT 4 V TT 



Proof. We let r = Vl + e^, d = arctan(e), and let Z = cos{9)X + sin(6')r be a 
normal random variable. Note that X + eV = rZ. Wc then have that 

Pr(/(X) ^ fix + eY)) < Pr(/(X) ^ f{Z)) + Pr(/(Z) ^ f{rZ)). 

By Theorem [T] and Lemma [3] this is at most 

dO , / n de de^ pn 

TT V 47r TT 4 y TT 

□ 

In particular, we relate this to Gaussian surface area by: 

Lemma 5. // / is a boolean function with /^^(l) open with smooth boundary 
and Gaussian area S , and if X and Y are independent Gaussians then, 

^.^ Pr(/(X) = -1 andfiX + eY) = l) ^ _S_ 

Proof First note that if ^ = f'^il), then 

Gaussian Volumcf AAA) 
lim = b 

rather than just the liminf being equal. Next note that since the probability 
that |e^| > e^^"^ goes rapidly to as e ^ 0, we can throw away all cases where 
X is not within e^^'^ of dA from the left hand side of Equation [5] When X is 
close to dA and when f{X) = —1, we may approximate the probability that 
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/(X + eY) = 1 by the probability that the component of eY in the direction 
of the shortest path from X to A is more than d{X, A) . Since dA is smooth, 
this approximation is accurate for X close to A, and in particular for X within 
e ' should introduce an error of 0(e'*/3), which can be ignored. Hence if Z 
is a normalized one variable Gaussian, the numerator of left hand side can be 
replaced by 

Pr(eZ > d{X, A)>0) = Pr (^Z > > q 

This is easily seen to be 

1 /"OO 1 

= e-='''^^PriO < d{X,A) < ex)dx = / -^e-''''^^GYol{A,^\A)dx 

-^e-^'/^Sexil + o{l))dx 
V 27r 

Se . , 
= —=+o{e). 

Thus completing our proof. □ 

Proof of Theorem This follows immediately from Corollary |4] and Lemma [5] 
after noting that 

Pr(/(X) = -1, f{X + ,Y) = 1) ^ ipr(/(X) ^ f{X + eY)) ^ ^. 

Z Zn 

□ 



4 Conclusion 

We have shown nearly tight bounds on the Gaussian surface area and noise 
sensitivity of polynomial threshold functions. One might hope to generalize 
these results to work for other distributions, such as the uniform distribution 
on vertices of the hypercube. Unfortunately, several aspects of this proof are 
difficult to generalize. Perhaps most significantly, we lose the symmetry that 
allowed us to prove our original result on noise sensitivity. Another difficulty 
would be in the relation between noise sensitivity and surface area. In our case 
the two are essentially equivalent quantities of study. On the other hand [5] 
defined a notion of surface area for the hypercube distribution and proved that 
for even linear threshold functions there could be a gap between noise sensitivity 
and surface area of as much as O ( ^log(n)) . 
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